Supplementary Figure SI: Validation on simulated data 

We tested our effective model on simulated data from a Wright-Fisher 
population, using a forward-in-time simulation scheme (see Supplementary 
Text). We varied the effective rate of sweeps, V, keeping the other two 
parameters N and \i fixed, and inferred all three parameters. As can be seen, this 
inference works well, in particular with respect to estimating V (a), with some 
underestimation for large values. The mutation rate is slightly underestimated 
by our model (b), and the effective population size estimates show some degree 
of anti-correlation with increasing V[c). 
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Supplementary Figure S2: Estimating selection from simulations 

(a-c) We also tested the unlinked model under selection using simulations. Here 
we simulated data sets varying the selection coefficient, with no linked sweeps, 
V = 0. As can be seen, the inference of all three parameters s, N and \i works very 
well up to about 2N 0 s = 10, and gets unreliable beyond that value, d) We also 
tested the inference of the selection coefficient with and without linked sweeps if 
all three neutral parameters V, N and \i are fixed. This is relevant to our estimates 
from non neutral annotation classes, where we fixed these neutral parameters 
from synonymous sites. 
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Supplementary Figure S3: Fitting a model with selection to synonymous sites 
We tested a model without hitchhiking but selection on synonymous sites (see 
Supplementary Text), (a-c) Inferred parameters are highly unrealistic for that 
model, yielding a 10 times higher mutation rate and a 10 times lower effective 
population size than inferred from the background selection and linked 
adaptation model. Most importantly, inferred selection coefficients are 
unrealistically high, d) With these extreme parameters, the resulting probability 
for allele frequencies under selection (green) is in principle able to explain to 
some extent the distortion of the frequency spectrum in low recombining 
regions, falling in between the background selection and the linked adaptation 
model. 
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Supplementary Figure S4: Comparison with the generalized MK-test 

This figure shows a comparison of our estimates of the fraction of adaptive 
substitutions with estimates from the generalized McDonald-Kreitman test (MK 
test). For all annotation categories, we plot the predicted fraction of adaptive 
substitutions from the background selection (blue) and linked adaptation model 
(red), together with estimates from the generalized MK test as a function of the 
recombination bin index. 
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Supplementary Figure S5: Causes of substitutions in different annotation 
categories 

In (a) we show the data from Figure 5a, but stratified across different functional 
categories, (b) shows the analogous version of Figure 5b in the main text, but for 
the less functional categories. 




Supplementary Figure S6: Log Likelihood difference of the linked adaptation 
vs. background selection model 

This figure shows the difference in log likelihood between the linked adaptation 
and the background selection model. In most bins, the two models perform 
equally, but we find that in the lowest four bins, the linked adaptation model 
performs substantially better, with a total score difference of 14,149 ± 6,914, 
where the error estimate is obtained from bootstrapping. 
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